Analyzing the EGEE Production Grid Workload: Application to Jobs Submission Optimization
نویسندگان
چکیده
Grids reliability remains an order of magnitude below clusters on production infrastructures. This work is aimsed at improving grid application performances by improving the job submission system. A stochastic model, capturing the behavior of a complex grid workload management system is proposed. To instantiate the model, detailed statistics are extracted from dense grid activity traces. The model is exploited in a simple job resubmission strategy. It provides quantitative inputs to improve job submission performance and it enables quantifying the impact of faults and outliers on grid operations.
منابع مشابه
An experimental comparison of Grid5000 clusters and the EGEE grid
In this paper, we present a set of experiments comparing the EGEE production infrastructure and the Grid5000 experimental one. Our goal is to better understand and quantify how these systems behave under load. We first identify specific characteristics of the workload and data management systems of these two infrastructures, underlining some of their limitations and suggesting some improvements...
متن کاملDeriving grid workload models from user submission strategies
Production-grid users experience many system faults as well as high and vari-able latencies due to the scale, complexity and sharing of such infrastructures. Toimprove performance, they adopt different submission strategies, that are poten-tially aggressive for the infrastructure.This work studies the impact of three different strategies. It is based on aprobabilistic mo...
متن کاملAnalyzing the Workload of the South-East Federation of the EGEE Grid Infrastructure
Grids have emerged as wide-scale, distributed infrastructures providing enough resources for always more demanding scientific experiments. EGEE is one of the largest scientific Grids in production operation today, with over 220 sites and more than 30,000 CPU all over the world. A further evolution of EGEE needs to be based on knowledge of deficiencies and bottleneck of the current infrastructur...
متن کاملCHARON System–Framework for Applications and Jobs Management in Grid Environment
We present a generic system for utilization of application programs in the EGEE Grid environment–the CHARON system. Charon was developed by computational chemistry community in the Czech Republic to provide easily manageable, comfortable, and modular environment to fulfill specific requirements of computational chemistry application users. It currently offers an alternative to standard LCG/EGEE...
متن کاملGATE Simulation for Medical Physics with Genius Web Portal
PCSV team of the LPC laboratory in Clermont-Ferrand is involved in the deployment of biomedical applications on the grid architecture. One of these applications deals with the deployment of GATE (Geant4 Application for Tomographic Emission) for medical physics application. The aim of the developments actually performed is to enable the usage of the GATE platform in clinical routine. However, th...
متن کامل